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DETAILED ACTION 
Response to Arguments 

1 . Applicant's arguments witli respect to claims 1-20 have been considered but are 
moot in view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 102 

2. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 1 02 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the Invention was patented or described in a printed publication in this or a foreign country or in public 
use or on sale in this country, more than one year prior to the date of application for patent in the United 
States. 

3. Claims 1, 5, 11, and 12 are rejected under 35 U.S.C. 102(b) as being anticipated 
by Bantz et al. (U.S. Patent 5,987,405). 

In regard to claim 1 , Bantz et al. disclose a method for error detection within text 
transcribed from a first speech signal by an automatic speech-to-text transcription 
system (speech input 16 converted to text 12 by a speech recognizer 1 1 , column 3, 
lines 7-14) comprising synthesizing a second speech signal from the transcribed text 
(synthesizer 14 accepts the textual representation and synthesizes a speech signal 15, 
column 3, lines 14-17), providing first and second speech signal outputs for a 
comparison between first and second speech signals for an identification of potential 
errors in the text (input speech signal 16 and synthesized speech signal 15 are 
compared by differencing engine 17 to produce an error term associated with the text, 
column 3, lines 17-29). 
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In regard to claim 5, Bantz et al. disclose a comparison signal Is generated by 
subtracting or superimposing first and second speech signals (the comparison signal 18 
is a difference between the input speech signal 16 and synthesized speech 15, column 
4, line 64 to column 5, line 12). 

In regard to claim 1 1 , Bantz et al. disclose an error detection system for a 
speech-to-text transcription system providing a transcribed text from a first speech 
signal (speech input 16 converted to text 12 by a speech recognizer 11, column 3, lines 
7-14), the error detection system comprising: 

means for synthesizing a second speech signal from the transcribed text 
(synthesizer 14 accepts the textual representation and synthesizes a speech signal 15, 
column 3, lines 14-17), 

means for providing first and second speech signals for comparison between first 
and second speech signals for an identification of potential errors in the text (input 
speech signal 16 and synthesized speech signal 15 are compared by differencing 
engine 17 to produce an error term associated with the text, column 3, lines 17-29). 

In regard to claim 12, Bantz et al. disclose a comparison signal Is generated by 
means of subtracting or superimposing first and second speech signals (the comparison 
signal 18 is a difference between the input speech signal 16 and synthesized speech 
15, column 4, line 64 to column 5, line 12). 
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Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not Identically disclosed or described as set 
forth In section 102 of this title, if the differences between the subject matter sought to be patented and 
the phor art are such that the subject matter as a whole would have been obvious at the time the 
Invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner In which the Invention was made. 

5. Claims 2, 16, and 17 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Bantz et al., in view of Yamazaki (U.S. Patent 6,088,674). 

In regard to claim 2, while Bantz et al. emphasize the necessity of matching the 
input speech signal to the synthesized speech signal (column 4, lines 16-63), Bantz et 
al. do not disclose the speed and/or volume of the second speech signal matches the 
speed and/or volume of the first speech signal. 

Yamazaki et al. disclose a method for comparing a first speech signal to a 
second speech signal generated from the text transcribed from the first speech signal, 
wherein: 

the speed and/or the volume of the second speech signal matches the speed 
and/or the volume of the first speech signal (speech is input to a speech recognition 
section, column 28, lines 6-9; the transcription of which is then used to generate a 
synthetic speech signal, lines 10-27; the amplitudes of the two waveforms are then 
matched, lines 54-60). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Bantz et al. to match the speed and/or volume of second speech 
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signal to the speed and/or the volume of the first speech signal, because this would 
reduce the calculations needed for the elastic matching procedure of Bantz et al. 
(because the signals would be better matched). 

In regard to claim 16, Bantz et al. disclose a computer program product for error 
detection for a speech-to-text transcription system providing a transcribed text from a 
first speech signal (speech input 16 converted to text 12 by a speech recognizer 1 1 , 
column 3, lines 7-14), the computer program product comprising program means for: 

synthesizing a second speech signal from the transcribed text (synthesizer 14 
accepts the textual representation and synthesizes a speech signal 15, column 3, lines 
14-17), 

providing first and second speech signal outputs for a comparison between first 
and second speech signals (input speech signal 16 and synthesized speech signal 15 
are compared by differencing engine 17 to produce an error term associated with the 
text, column 3, lines 17-29) 

While Bantz et al. emphasize the necessity of matching the input speech signal 
to the synthesized speech signal (column 4, lines 16-63), Bantz et al. do not disclose 
the speed and/or volume of the second speech signal matches the speed and/or 
volume of the first speech signal. 

Yamazaki et al. disclose a method for comparing a first speech signal to a 
second speech signal generated from the text transcribed from the first speech signal, 
wherein: 
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matching the speed and/or the volume of the second speech signal to the speed 
and/or the volume of the first speech signal (speech is input to a speech recognition 
section, column 28, lines 6-9; the transcription of which is then used to generate a 
synthetic speech signal, lines 10-27; the amplitudes of the two waveforms are then 
matched, lines 54-60). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Bantz et al. to match the speed and/or volume of second speech 
signal to the speed and/or the volume of the first speech signal, because this would 
reduce the calculations needed for the elastic matching procedure of Bantz et al. 
(because the signals would be better matched). 

In regard to claim 17, Bantz et al. disclose a comparison signal is generated by 

means of subtracting or superimposing first and second speech signals (the comparison 
signal 18 is a difference between the input speech signal 16 and synthesized speech 
15, column 4, line 64 to column 5, line 12). 

6. Claim 3 is rejected under 35 U.S.C. 103(a) as being unpatentable over Bantz et 
al., in viewof Shimura (U.S. Patent 5,175,799). 

In regard to claim 3, while Bantz et al. emphasize the necessity of matching the 
input speech signal to the synthesized speech signal (column 4, lines 16-63), Bantz et 
al. do not disclose a set of filter functions is applied to the first speech signal to 
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approximate the spectrum of the first speech signal to the spectrum of the second 
speech signal. 

Shimura discloses a method of matching an input speech signal to a synthesized 
speech signal, wherein a set of filter functions is applied to a first speech signal to 
approximate the spectrum of the first speech signal to the spectrum of a second speech 
signal (speech is synthesized using a filter bank derived from an input speech signal, 
column 4, lines 40-59). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Bantz et al. to apply a set of filter functions to the first speech signal 
to approximate the spectrum of the first speech signal to the spectrum of the second 
speech signal, because this would reduce the calculations needed for the elastic 
matching procedure of Bantz et al. (because the signals would be better matched). 

7. Claim 4 is rejected under 35 U.S.C. 103(a) as being unpatentable over Bantz et 
al., in view of Hon et al. (U.S. Patent 6,490,563). 

In regard to claim 4, Bantz et al. do not disclose the second speech signal is 
generated by applying an inverse speech transcription process, generating a feature 
vector sequence from the text, using (a) statistical models of the speech-to-text 
transcription system and (b) a state sequence obtained in the process of transcription of 
the text from the first speech signal. 

Hon et al. disclose a method for error detection within transcribed text comprising 
generating a second speech signal by applying an inverse speech transcription process. 



Application/Control Number: 10/578,073 Page 8 

Art Unit: 2626 

generating a feature vector sequence from the text, using (a) statistical models of the 
speech-to-text transcription system and (b) a state sequence obtained in the process of 
transcription of the text from the first speech signal (HMM models are used to both 
generate the speech, column 6, lines 41-57; as well as recognize the input speech, 
column 8, lines 43-57). 

It would have been obvious to one of ordinary skill in the art at the time of 
invention to modify Bantz et al. to generate a second speech signal by applying an 
inverse speech transcription process, generating a feature vector sequence from the 
text, using (a) statistical models of the speech-to-text transcription system and (b) a 
state sequence obtained in the process of transcription of the text from the first speech 
signal, because this would reduce the calculations needed for the elastic matching 
procedure of Bantz et al. (because the signals would be better matched). 



Allowable Subject Matter 

8. Claims 6-1 0, 1 3-1 5, and 1 8-20 are objected to as being dependent upon a 
rejected base claim, but would be allowable if rewritten in independent form including all 

of the limitations of the base claim and any intervening claims. 

The following is a statement of reasons for the indication of allowable subject 
matter: 

In regard to claims 6, 13, and 18, Bantz et al., Yamakazi, Shimura, and Hon et al. 
do not disclose or suggest providing the comparison signal either acoustically or 
visually. 
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In regard to claims 7, 14, and 19, Bantz et a!., Yamakazi, Shimura, and Hon et al. 
do not disclose or suggest that an error signal is output when the comparison signal is 
beyond a predefined range. 

In regard to claims 9, 15, and 20, Bantz et al., Yamakazi, Shimura, and Hon et al. 
do not disclose or suggest detecting patterns in the comparison signal to detect a 
certain type of error in the transcribed text. 

Conclusion 

9. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to BRIAN L. ALBERTALLI whose telephone number is 
(571)272-7616. The examiner can normally be reached on Monday-Thursday, 8 AM to 
6:30 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, David Hudspeth can be reached on (571) 272-7843. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/David R Hudspeth/ 

Supervisory Patent Examiner, Art Unit 2626 

BLA 11/20/08 



